Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hyperdrive: A Systolically Scalable Binary-Weight CNN Inference Engine for mW IoT End-Nodes

Deep neural networks have achieved impressive results in computer vision and machine learning. Unfortunately, state-of-the-art networks are extremely computeand memoryintensive which makes them unsuitable for mW-devices such as IoT end-nodes. Aggressive quantization of these networks dramatically reduces the computation and memory footprint. Binary-weight neural networks (BWNs) follow this tren...

متن کامل

A CNN Universal Chip in CMOS Technology

This paper describes the design of a programmable Cellular Neural Network (CNN) chip, with added functionalities similar to those of the CNN Universal Machine. The prototype contains 1024 cells and has been designed in a 1.0μm, n-well CMOS technology. Careful selection of the topology and design parameters has resulted in a cell density of 31 cells/mm2 and around 7-8 bits accuracy in the weight...

متن کامل

Chipmunk: A Systolically Scalable 0.9 mm2, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference

Recurrent neural networks (RNNs) are state-of-the-art in voice awareness/understanding and speech recognition. Ondevice computation of RNNs on low-power mobile and wearable devices would be key to applications such as zero-latency voicebased human-machine interfaces. Here we present CHIPMUNK, a small (<1 mm) hardware accelerator for Long-Short Term Memory RNNs in UMC 65 nm technology capable to...

متن کامل

Architecture Design of a Scalable Single-Chip Multi-Processor

Now that system-on-chip technology is emerging, singlechip multi-processors are becoming feasible. A key problem of designing such systems is however the complexity of their interconnect and memory architecture [1]. An example of a single-chip multi-processor for real-time (embedded) systems is the Multi Micro Processor (M P). Its architecture consists of a scalable number of identical master p...

متن کامل

Stochastic Inference for Scalable Probabilistic Modeling of Binary Matrices

Fully observed large binary matrices appear in a wide variety of contexts. To model them, probabilistic matrix factorization (PMF) methods are an attractive solution. However, current batch algorithms for PMF can be inefficient because they need to analyze the entire data matrix before producing any parameter updates. We derive an efficient stochastic inference algorithm for PMF models of fully...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Journal on Emerging and Selected Topics in Circuits and Systems

سال: 2019

ISSN: 2156-3357,2156-3365

DOI: 10.1109/jetcas.2019.2905654